Microsoft Bing Team Open Sources Harrier Multilingual Embedding Model
Microsoft Bing team open sources the word embedding model Harrier, which supports over 100 languages and performs excellently in the MTEB v2 benchmark. The model is trained on 2 billion examples and GPT-5 synthetic data, using a 32,000 token context window, with 2.7 billion parameters, significantly improving the accuracy and flexibility of multilingual tasks.